NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Scheduling Parallel-Task Jobs Subject to Packing and Placement Constraints

https://doi.org/10.1287/opre.2021.2198

Shafiee, Mehrnoosh; Ghaderi, Javad (January 2022, Operations Research)

Motivated by modern parallel computing applications, we consider the problem of scheduling parallel-task jobs with heterogeneous resource requirements in a cluster of machines. Each job consists of a set of tasks that can be processed in parallel; however, the job is considered completed only when all its tasks finish their processing, which we refer to as the synchronization constraint. Furthermore, assignment of tasks to machines is subject to placement constraints, that is, each task can be processed only on a subset of machines, and processing times can also be machine dependent. Once a task is scheduled on a machine, it requires a certain amount of resource from that machine for the duration of its processing. A machine can process (pack) multiple tasks at the same time; however, the cumulative resource requirement of the tasks should not exceed the machine’s capacity. Our objective is to minimize the weighted average of the jobs’ completion times. The problem, subject to synchronization, packing, and placement constraints, is NP-hard, and prior theoretical results only concern much simpler models. For the case that migration of tasks among the placement-feasible machines is allowed, we propose a preemptive algorithm with an approximation ratio of [Formula: see text]. In the special case that only one machine can process each task, we design an algorithm with an improved approximation ratio of four. Finally, in the case that migrations (and preemptions) are not allowed, we design an algorithm with an approximation ratio of 24. Our algorithms use a combination of linear program relaxation and greedy packing techniques. We present extensive simulation results, using a real traffic trace, that demonstrate that our algorithms yield significant gains over the prior approaches.
more » « less
Full Text Available
Scheduling Coflows With Dependency Graph

https://doi.org/10.1109/TNET.2021.3116133

Shafiee, Mehrnoosh; Ghaderi, Javad (October 2021, IEEE/ACM Transactions on Networking)
null (Ed.)
Full Text Available
On Max-Min Fairness of Completion Times for Multi-Task Job Scheduling

Shafiee, Mehrnoosh; Ghaderi, Javad (July 2020, 2020 IFIP Networking Conference (Networking))

We study the max-min fairness of multi-task jobs in distributed computing platforms. We consider a setting where each job consists of a set of parallel tasks that need to be processed on different servers, and the job is completed once all its tasks finish processing. Each job is associated with a utility which is a decreasing function of its completion time, and captures how sensitive it is to latency. The objective is to schedule tasks in a way that achieves max-min fairness for jobs' utilities, i.e., an optimal schedule in which any attempt to improve the utility of a job necessarily results in hurting the utility of some other job with smaller or equal utility. We first show a strong result regarding NP-hardness of finding the max-min fair vector of job utilities. The implication of this result is that achieving max-min fairness in many other distributed scheduling problems (e.g., coflow scheduling) is NP-hard. We then proceed to define two notions of approximation solutions: one based on finding a certain number of elements of the max-min fair vector, and the other based on a single-objective optimization whose solution gives the max-min fair vector. We develop scheduling algorithms that provide guarantees under these approximation notions, using dynamic programming and random perturbation of tasks' processing times. We verify the performance of our algorithms through extensive simulations, using a real traffic trace from a large Google cluster.
more » « less
Full Text Available
On Max-Min Fairness of Completion Times for Multi-Task Job Scheduling

Shafiee, Mehrnoosh; Ghaderi, Javad (July 2020, IFIP Networking Conference)

We study the max-min fairness of multi-task jobs in distributed computing platforms. We consider a setting where each job consists of a set of parallel tasks that need to be processed on different servers, and the job is completed once all its tasks finish processing. Each job is associated with a utility which is a decreasing function of its completion time, and captures how sensitive it is to latency. The objective is to schedule tasks in a way that achieves max-min fairness for jobs’ utilities, i.e., an optimal schedule in which any attempt to improve the utility of a job necessarily results in hurting the utility of some other job with smaller or equal utility.We first show a strong result regarding NP-hardness of finding the max-min fair vector of job utilities. The implication of this result is that achieving max-min fairness in many other distributed scheduling problems (e.g., coflow scheduling) is NP-hard. We then proceed to define two notions of approximation solutions: one based on finding a certain number of elements of the max-min fair vector, and the other based on a single-objective optimization whose solution gives the max-min fair vector. We develop scheduling algorithms that provide guarantees under these approximation notions, using dynamic programming and random perturbation of tasks’ processing times. We verify the performance of our algorithms through extensive simulations, using a real traffic trace from a large Google cluster.
more » « less
Full Text Available
An Improved Bound for Minimizing the Total Weighted Completion Time of Coflows in Datacenters

https://doi.org/10.1109/TNET.2018.2845852

Shafiee, Mehrnoosh; Ghaderi, Javad (August 2018, IEEE/ACM Transactions on Networking)

Full Text Available
A Simple Congestion-Aware Algorithm for Load Balancing in Datacenter Networks

https://doi.org/10.1109/TNET.2017.2751251

Shafiee, Mehrnoosh; Ghaderi, Javad (December 2017, IEEE/ACM Transactions on Networking)

Full Text Available
Brief Announcement: A New Improved Bound for Coflow Scheduling

https://doi.org/10.1145/3087556.3087598

Shafiee, Mehrnoosh; Ghaderi, Javad (January 2017, Brief Announcement: A New Improved Bound for Coflow Scheduling)

Full Text Available

Search for: All records